MPP-2005-122 
LMU-ASC 68/05 

hep-th/0510170 



o ■ One m a Billion: MSSM-like D-Brane Statistics 
o 

(N 

, Florian Gmeiner^'* 

Q ■ Ralph Blumenhagen^ , Gabriele Honecker^, Dieter Liist^'*, Timo Weigand^'* 
(N 

en . ^ Max- Planck- Institut fur Physik, Fdhringer Ring 6, 

^ \ 80805 Miinchen, Germany 

', * Arnold- Sommerf eld- Center for Theoretical Physics, Department fiir Physik, 
Ludwig-Maximilians-Universitdt Miinchen, Theresienstrafie 37, 80333 Miinchen, 

Germany 

in 

O ■ flo, blumenha, gabriele, luest, weigandOmppmu.mpg.de 



Oh! 



X 



Abstract 

Continuing our recent work"hep-th/0411173', we study the statistics of four- 
dimensional, supersymmetric intersecting D-brane models in a toroidal 
orientifold background. We have performed a vast computer survey of 
solutions to the stringy consistency conditions and present their statisti- 
cal implications with special emphasis on the frequency of Standard Model 
features. Among the topics we discuss are the implications of the K-theory 
constraints, statistical correlations among physical quantities and an inves- 
tigation of the various statistical suppression factors arising once certain 
Standard Model features are required. We estimate the frequency of an 
MSSM like gauge group with three generations to be one in a billion. 
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1 Introduction 

The identification of MSSM-like string vacua is of obvious importance. Meth- 
ods have been developed to study compactifications in various corners of the 
string respectively M-theory moduli space. The classes of models that have been 
discussed in most detail are certainly Eg x Eg heterotic string compactifications 
(e.g. [1-3]) and Type II orientifold models with intersecting/magnetised D-branes 
(for references we refer to the most recent review [4]). 

Despite the enormous effort put into this study and the unquestionable suc- 
cesses in understanding the structure of these string models, we are still lacking 
a single fully realistic candidate. Models have been found which naturally give 
rise to certain features of the Standard Model, but all promising candidates have 
failed to be realistic at a certain step. Before getting too desperate about these 
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shortcomings, though, one should keep in mind that the current search is re- 
stricted to very special corners of the overall configuration space, namely those 
which arc technically under good control such as toroidal orbifolds or Gcpner 
model orientifolds [5-8]. A scan of all possible models is still far beyond the 
present state of the art. Therefore, the reason why no perfect model has been 
found yet might simply be that there are too many string vacua and that the, 
say, one million models encountered so far are only the tip of the iceberg of the 
enormous plethora of possible string vacua. 

The existence of a very large number of string vacua has found convincing 
support recently by the study of flux vacua (for references see the review [9]). 
The fluxes induce a superpotential which allows to freeze the former moduli 
flelds related to the size and shape of the underlying geometry. A rough estimate 
of the number of stable minima gave that there might exist of the order of lO^^*^ 
string vacua. During the last two years, this huge number has triggered new 
ideas both on flne tuning problems in particle physics and cosmology and on the 
right approach towards the problem of identifying realistic string vacua. In [10] it 
was advocated that, complementary to the model by model search, a statistical 
approach to the string vacuum problem might be worthwhile to pursue. In fact 
such an analysis could reveal insights into the space of string vacua which might 
eventually provide some hints as to in which corner one should look for realistic 
models or may at least give us an estimate of the abundance of Standard-like 
models. We might even have to face the prospect that such a statistical analysis 
is the best we can ever do. 

There exists a still growing collection of work dealing with the statistics of 
both Type II and M-theory flux compactifications. In these studies the statis- 
tical analysis is mostly concerned with the gravitational (closed string) sector 
of the vacua, see e.g. [11-23].-'^ Comparatively httle is known about the statis- 
tics in the gauge sector of the theory [8,26-31]. Of course these two sectors are 
not unrelated, as, like in Type II, many open string couplings also depend on 
the closed string moduli and therefore on the fluxes freezing them. Even more 
drastically, the closed sector back-reacts on the open string sector giving rise to 
supersymmetry breaking and induced soft-terms on the branes [32-36] . These are 
all phenomenologically very important and string theoretically very involved is- 
sues, which in a really complete statistical analysis have to be taken into account. 
However, practically such a thorough analysis is beyond our understanding of the 
theory and therefore has to wait until these points are better and more generally 
understood. 

As in our flrst analysis [28] , in this paper we follow a more modest approach 
and try to investigate for a quite well understood concrete example the statistical 
distributions of some of the main quantities of the gauge theory sector like the 
rank of the gauge group or the number of generations. More concretely, we study 

^Criticism of the landscape idea is expressed in [24,25]. 
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supersymmetric intersecting D-brane models on the Z2XZ2 o^isi^^ifold background, 
which has enjoyed great interest in the past (see e.g. [37-41] among many others). 
We consider the ensemble of solutions to the tadpole cancellation conditions using 
a special class of supersymmetric, so-called 'factorizable', D-branes. 

In general there will appear a D-term potential which contains scalar fields 
charged under the U{1) gauge symmetry and Fayet-Iliopoulos terms depending 
on one half of the closed string moduli fields (complex structure moduli for Type 
IIA and Kahler moduli for T-dual Type IIB models). Therefore, at this level one 
expects to get a moduli space of vacua, parameterised by combinations of closed 
and open string scalars and containing regions with different gauge symmetry 
and chiral matter content. As a result, it is not so clear what one should actually 
count as a string vacuum. Our attitude is that we are not really counting different 
unconnected string moduli spaces but we are counting regions in the moduli space 
with different gauge groups. 

Turning on a charged open string modulus corresponds geometrically to a 
recombination process of branes and therefore leads in general to curved objects 
outside our class of flat factorizable branes. On the other hand the inclusion 
of supersymmetric three-form fluxes (in the Type IIB picture) might lead to yU- 
terms for some of the charged non-chiral fields and therefore freeze some of the 
open string moduli. It seems that the best we can do at the moment is to count 
solutions of the tadpole cancellation and K-theory conditions in the restricted set 
of supersymmetric branes one can actually describe. Since we are doing statistics 
we are confident that the distributions we find are relatively stable against the 
inclusion of a larger set of branes. 

In [28] we have mainly used the saddle point method to derive the distributions 
and correlations of various physical quantities and for eight- and six-dimensional 
toy models we could confirm this technique against a brute force computer search. 
For the most interesting four- dimensional models the saddle point method also 
became more involved and time consuming and at the same time a brute force 
computer search could not be carried out in, say, a couple of weeks. Moreover, 
in [28] we had not taken the additional K-theory constraints [42] into account. 

In this paper we report on the results of an extensive computer search involv- 
ing several computer clusters looking for supersymmetric models which satisfy 
both the tadpole and the K-theory constraints. Using this large set of data we 
will statistically analyse the question: Among all these models, what is the per- 
centage satisfying certain Standard Model requirements, like Standard Model or 
Pati-Salam gauge group, a massless hypercharge and three fermion generations? 

The very special framework of open string models on which our analysis is 
based immediately raises a crucial question: How generic are our results in the 
complete moduli space of string theory? A trustworthy answer of this question 
can only be given by comparing the analysis of this article to analogous statistical 
results in corresponding corners of the landscape. In view of the current status in 
the systematic exploration of string vacua, progress seems difficult to make. As 
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a very first step into this direction, we attempt a comparison of our large volume 
statistics and the small radius Gepner models of [8] wherever the two sets of data 
are compatible. In particular we distinguish between topological and geometrical 
observables and analyse how their distributions differ (or not). 

The paper is organized as follows: Section El reviews the well known proper- 
ties of the class of models we are considering. We give details about the tadpole, 
supersymmetry and K-theory constraints that arise in general and about the 
additional conditions we get from requiring Standard Model-like properties. In 
section IHl we explain the implementation of the problem in algorithmic form, 
suitable for a systematic computer analysis. Furthermore we address the issue 
of finiteness of the number of solutions. Section 0] contains the results of our 
computer search in form of statistical distributions, while in section El we anal- 
yse these distributions focusing on the correlation of variables. In section IHl we 
summarize our conclusions and give an outlook to further directions of research. 

2 Model building ingredients 

2.1 Parameterisation of Type II orientifolds 

Orientifolds with magnetised D-branes are T-dual to orientifold models of inter- 
secting D-branes. Although the inclusion of three-form fluxes is better understood 
in the Type IIB picture, the general models will be presented in the Type IIA 
picture with intersecting D6-branes, where many quantities have a geometrical 
interpretation at least for vanishing three-form flux. This is the case on which 
we concentrate in this work. 

In the Type IIA language the orientifold action is taken to be fla, where a is 
an anti-holomorphic involution on the compact six-dimensional space preserving 
three-cycles Hoe- The R-R charge of the 06-planes has to be cancelled by a 
suitable set of stacks of Na D6-branes wrapping the three-cycles Ua and their fla 
images 11^. The R-R tadpole cancellation condition can be written simply as 



The homology group H^{Ai) can be decomposed into its Qa even and odd 
parts, HsiM) = H^iM) © H^{M). If «/ G Ht{M) and Pi G H^{M) form 
a symplectic basis, i.e. aj o Pj = Sjj, aj o aj = /?/ o /3j = with I, J & 
{1, . . . , /i2,i + 1}, any cycle can be expanded as 



k 




(1) 



a=l 



Ua = Xad + Yap, K = ^ad - YaP, Hoe = - 




(2) 



Here XqcJ = ^ 



/12,1 + 1 
1=1 



X[ai and , Y^ are integer valued expansion coefficients. 
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The supersymmetry conditions read 

YaF{U) = 0, XalJ > 0, (3) 

where Uj = J^^ Q3 are the complex structure moduh and Fj = J^^ fl^. 
The intersection number between two cycles is given by 

lab = O n, = XaY, - X,Ya. (4) 

The general chiral spectrum is computed from the intersection numbers as shown 
in table m 



reps. 


multiplicity 


reps. 


multiplicity 


(N,,Nfc) 
(N,,N,) 


lab 
lab' 


Antia 


^{laa' — laOe) 
^{laa' + laOe) 



Table 1: Multiplicities of the chiral spectrum from the intersection numbers. 



2.2 Gauge anomalies and K-theory constraints 

Using table HI and (jlj) one can compute that the cubic SU{Na) gauge anomalies 
vanish if the tadpole cancellation condition is satisfied. 

Mixed abelian anomalies on the other hand do occur. The mixed gauge 
anomaly, for example, is of the form 

-^C/(l)a-SC/(Af(,)2 ~ Na{Iab + Iab')C2{Nb) 

= -2NaYaXbC2{N^) (5) 

up to terms which vanish upon tadpole cancellation. C2{Ni,) denotes the quadratic 
Casimir operator of the fundamental representation of SU{Niy). 
A massless linear combination of abelian factors 

U massless 

a 

exists for 

Y,^aNaYa = 0, (7) 

a 

as can be easily seen from More details can be found in [43]. 

Although the tadpole cancellation condition ensures the absence of cubic 
non-abelian gauge anomalies and mixed and cubic abelian gauge anomalies are 
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cancelled by a generalized Green-Schwarz mechanism, there exists one further 
model building constraint [42] due to a Z2-valued conserved quantity, the K- 
theory charge. This quantity can be seen by introducing a probe Sp{2) ~ SU{2) 
brane. If the K-theory charge conservation is violated, there occurs a global gauge 
anomaly. This anomaly is manifest as the existence of an odd number of chiral 
fermions transforming in the fundamental representation of Sp{2) [44]. 

2.3 The TV (^2 x Z2) model 

Factorizable three-cycles in a toroidal background are parameterised by their 
wrapping number (rii^rrii), i G {1,2,3}, along the basic one-cycles (7r2i-i, 7r2i) of 
Y[i=i "^i- Two different shapes per two-torus Tf respect the Q,a symmetry — > 
and they are parameterised by &i e {0, |}, 

Qct : 7r2i_i 7r2i_i - 26j 7r2i, Q-a : ir^i -t^ii- (8) 
It is convenient to work with the Via even combination 

7f2i-l — 7r2i-l — &j7r2i, (9) 

because this modification does not change the computation of basic cycle inter- 
section numbers (although only 1x21-1 lies in the torus lattice). 

The effective overall wrapping numbers of the three-cycles are conveniently 
parameterised by the integers 

= 677,1^2^3, Yq = hfhifh2'fh-i, (10) 
X'^ — —bnirhjrhk, Y^ = —hrhinjUki i, j, /c G {1, 2, 3} cyclic, 

where rhi — rrii -\- h rii is the effective wrapping number along 7f2i-i and b — 
[(1 — bi){l — b2){l — ^s)]"^- Observe that as in [28] we have included a minus 
sign in the definition of Xj and Yi for i = 1,2, 3. Furthermore due to the overall 
scaling factor 6, which we introduce to obtain integer valued quantities also for 
tilted tori, the intersection numbers are computed from 

Iab^b-^(jCafb-ib^a). (H) 

In terms of these quantities, the supersymmetry condition reads 

3 3 

E^'^ = 0' E^'t^/>0, (12) 
7=0 ^ 7=0 

where the Uj are defined in terms of the torus radii as 

Uo = RPR'i^Rf\ Ui = RfB!i^Rf\ i,j, k e {1, 2, 3} cyclic. (13) 
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The tadpole cancellation condition is given by 

a 

E^'^^^ = {1,2,3}, (14) 

a * 

i.e. we have L = L (6, (1 — with L = 8. The effective wrapping numbers 

can easily be rewritten in the original unsealed wrapping numbers along the basic 
three-cycles via the relations {i,j,k G {1,2,3} cychc) 

(3 3 
F° + 5^ h,X' - J2 bjbkY' - bihhX"^ 
1=1 i=l / 

Y' = miUjUk = b-^ (~Y' - biX^) . (15) 



These quantities are needed to implement the coprime condition on the sets 
{rii, rrii) of the wrapping numbers per two-torus, 

3 

{Y,f = \{gcd{Yo,X,). (16) 

i=l 

However, for computational convenience we will work with the rescaled wrapping 
numbers fllU|) . which fulfill the same multiplicative relations as the original ones, 

X^Y^ = X-^Y^ for all /, J G {0, . . . , 3}, (17) 
X'X^ = -Y^Y^, X^{Y^f = -X^X^X^, Y^{X^f = -Y^Y-^Y^ , 

where on the second line /, J, K, L are permutations of {0, ... , 3}. 

The number of chiral (anti) symmetric representations of some U{Na) factor 
is given in table ^ If one imposes for phenomenological reasons the constraint 
#(Sym^) = 0, this leads to #(Antia) = —-^X^Y^ and one can distinguish two 
cases: 

1. #(Antia) = #(Sym^) = 0, where for some permutation {I,J,K,L) of 
(0, . . . , 3) we have the relations on supersymmetric factorizable branes 

Y^L^ + Y^L^ = 0, L^U^ = L^U^. (18) 

These are the types of branes occurring also in compactifications to six 
dimensions as explained in [28]. 
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2. #(Antia) ^ 0, #(Sym„) = 0: in this case ^ for all / G {0, . . . , 3} and 
the constraint on the vanishing net number of chiral symmetric representa- 
tions can be rephrased as 

Ei^^ = 16, (19) 

7=0 

while ^ has to be fulfilled. 

The D-brane configurations discussed so far support U{Na) gauge factors. In 
addition, there exist four different kinds of Sp{Nb) gauge factors, each of them 
descending from branes wrapping the VLaO^uo^ {k, I G {0, 1}) invariant planes, 
where 6 and u are the two orbifold generators. The wrapping numbers of these 
branes are k G {1, 2, 3} cyclic) 

X° = 6^ X^ = X^ = X^ = or X' = — X° = X^ = X^ = 0, 

1-bi 

= for all/. (20) 

The K-theory constraints on consistent chiral spectra demand that there must 
be an even number of chiral fermions transforming in the fundamental represen- 
tation of any possible Sp{2) factor. Inserting the wrapping numbers of all four 
5*^(2) candidates into (fTTjl and summing over all U{Na) factors, the K-theory 
constraints take the form 

Y,Nat' e 2Z, 

a 

{l-b,){l-bk)J2NaY: G 2Z, i,j,A;G {1,2,3} cyclic. (21) 



2.4 Standard Model realisations 

For a Standard Model like sector of x quark and lepton generations in a four 
stack intersecting D-brane model, the gauge group has to contain one of the two 
following factors 

1. U{3)a X Sp{2), X f/(l), X f/(l), 

2. f/(3), X f/(2), X f/(l), X f/(l), 

with 7^(Sym„) = ^^(Symf,) = in order to have no exotic symmetric chiral matter 
of the two non-abelian factors. The first stack is therefore of the type (fTH|l or p9|l . 
the second can also be of the form ()20|) . In some cases, the Standard Model 
quantum numbers can also be realised on three stacks only at the cost of having 
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particle 




mult. 


U(S)a 


X Sp{2)t, X C/(l), X U{l)a with Q^?^ 


Ql 


(3,2)0,0 


lab 


ur 


(3,l)_i,o+(3,l)o,-i 


la'c + la'd 


dR 


(3, 1)1,0 + (3, 1)0,1 


la'c' + la'd' 


dn 


(3a, 1)0,0 


\{Iaa' + laOo) 


L 


(l,2)_i,o + (1,2)0,-1 


he + hd 




(1,1)2,0 


\{Iec' ~ hoe) 


eR 


(1,1)0,2 


^{hd' — hoe) 


eR 


(1,1)1,1 


led' 




X U{2\ X C/(l), X U(l)d with Q^f^ 


Ql 


(3,2)0,0 


hb 


Ql 


(3,2)0,0 


hb' 


Ur 


(3,l)_i,o+(3,l)o,-i 


h'e + h'd 


dR 


(3, 1)1,0 + (3, 1)0,1 


h'c' + h'd' 


<Ju 


(3.1. l)oi, 


^i-Lia' + LiOg) 


L 


(l,2)_i,o+ (1,2)0,-1 


he + hd 


L 


(l,2)-i,o + (1,2)0,-1 


h'c + h'd 


eR 


(1, 1)2,0 


^{hc' — hoe) 


eR 


(1, 1)0,2 


^{hd' — hoe) 


eR 


(1,1)1,1 


led' 



Table 2: Realisation of quarks and leptons for various hyper charges. Part 1. 



no standard Yukawa couplings (which are not always realistic in the four stack 
models either). 

Up to an interchange of cycles and their Qa images, there exist three different 
possible definitions of the hypercharge for quarks and left-handed leptons: 

1. The 'standard' definition Q^^ — ^Qa + ^Qc + ^Qd, vahd for both choices of 
an Sp{2)b or C/(2)b factor. This hypercharge is massless \IY a-\-Y c-\-Y ^ — 0. 
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particle 




mult. 


U{3)a X U{2), X C/(l), X U{1) 


d with Qy^ 


Ql 


(3,2)0,0 


lab 


Ur 


(3a, 1)0,0 


\{Iaa' + IaO%) 


dn 


(3, l)_i,o + (3, 1)0,-1 




la'c + la'd 


dR 


(3, 1)1,0 + (3, 1)0,1 




la'c' + la'd' 


L 


(l,2)_i,o + (1,2)0,-1 




he + hd 


L 


(1,2)1,0 + (1,2)0,1 




hd + hd' 


eR 


(1, 1a)o,o 




-\{hb' + hoe) 


U{3)a X U{2), X U{1), X U{1) 


d with 


Ql 


(3,2)0,0 


lab 


Ur 


(3a, 1)0,0 


\{,Iaa' + laOo) 


dR 


(3, l)_i,o 




h'c 


dR 


(3,1)1,0 




h'd 


L 


(1,2)0,-1 


h'd 


eR 


(1, 1a)o,o 




-\{hv + ^oe) 


eR 


(1,1)1,1 




hd' 


eR 


(l,l)-i,i 




h'd' 



Table 3: Realisation of quarks and leptons for various hyper charges. Part 2. 



2. The first 'non-standard' definition Qy = —-^Qa — 2^6, where right-handed 
up-type quarks are realised as antisymmetric representations of U{3)a [45, 

46]. In order for this hypercharge candidate to be massless, Ya + Yb — 
has to be satisfied. 

3. The second 'non-standard' definition Qy^ = —^Qa — ^Qb + Qd-, where again 

right-handed up-type quarks are realised as antisymmetric representations 
-* -* -* 

of U{3)a- Ya + Yb — Yd — is the condition for this abehan factor to remain 
massless. 

The 'standard' definition of the hypercharge on four stacks admits the reahsa- 
tion of all Standard Model particles as bifundamental representations, thereby 
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ensuring the existence of Yukawa couplings from triangles of three intersecting 
branes. This case can be reduced to a three stack model by simply replacing 

|(Qc + Qd) \Qc- 

The first 'non-standard' definition also has a direct reduction to three stacks 
since Qc and Qd do not appear in the definition of Qy^- other hand, the 

second 'non-standard' definition really requires four stacks of branes. 

The corresponding realisations of the Standard Model quarks and leptons are 
given in tables |21 and El Apart from the listed particles, there exist candidates 
for right handed neutrinos and Higgs multiplets for each possible hypercharge. 

Having xq generations of quarks amounts for the first case to setting 

XQ = lab = la'c + la'd = la'd + la'd' + Iji^aa' + laOo) (22) 

and xl lepton generations occur for 

XL = he + hd = 2^hc' — IcOg) + 2^hd' — IdOo) + hd'- (23) 

For the remaining cases, the analogous formulae can be read off from tables El 
and 01 



3 Computer algorithms 

We have defined the wrapping numbers and as integer valued quantities in 
order to implement the supersymmetry ()12j) and tadpole ()14j) conditions in a fast 
computer algorithm. From the equations we can derive the following inequalities 

3 3 

Q <^X^Ui <^UUi. (24) 

7=0 7=0 

In a first step possible values for the wrapping numbers and are com- 
puted using for a fixed set of complex structures. 

In a second step we use the tadpole equations ((HJ, which after summation 
can be reduced to 

^Sa = C with Sa = ^NaUiXl and C = ^UUi. (25) 

all 

The algorithm to find solutions to these equations computes all possible par- 
titions of C and factorises them into possible values for Na and X^, taking only 
factorizations into account which match the values generated in the first step. 
The results obtained in this way have to be checked again if they satisfy all 
consistency conditions, especially the K-theory constraints. 
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3.1 Number of solutions 



It is an important question whether or not the number of solutions is infinite, 
because a statistical statement can only be of significance if we are dealing with 
a finite set of models. Unfortunately we have not been able to give a complete 
analytic proof of finiteness of the considered space of models, but we have good 
numerical hints towards this assumption. 

As explained in [28], it is sufficient to analyse the possible configurations of 
wrapping numbers for models in which all are non-vanishing. The tadpole 
equations can then be shown to admit only a finite number of supersymmetric 
configurations for a fixed choice of rational complex structures and fixed L. The 
difficult question is which complex structures are compatible with the consistency 
conditions. For toroidal compactifications to six dimensions it is possible to 
give an upper bound for the complex structures in terms of L. In the four- 
dimensional case, however, it is not immediately clear how to achieve this. Figure 
Eshows how the total number of mutually different brane configurations for L = 2 
increases and saturates, as we include more and more combinations of values for 
the complex structures Uj into the set for which we construct solutions. For this 
small toy value of L our algorithm actually admits pushing the computations up 
to those complex structures where obviously no additional brane solutions exist. 

For the physically relevant case of L = 8 the total number of models compared 
to the absolute value |f/| of the complex structure variables scales as displayed 
in figure El The plot shows all complex structures we have actually been able to 
analyse systematically. We find that the number of solutions falls logarithmically 
for increasing values of |f/|. In order to interpret this result, we observe that the 
complex structure moduli Uj are only defined up to an overall rescaling by the 
volume modulus of the compact space. We have chosen all radii and thereby also 
all Ui to be integer valued, which means that large \ U\ correspond to large coprime 
values of R^^^ and i?2*'*- This comprises on the one hand decompactification limits 
which have to be discarded in any case for phenomenological reasons, but on 
the other hand also tori which are slightly distorted from e.g. square tori for 
Rf/nf = 0.99. 

Combining the results of the two numerical tests we have reason to hope that 
we can indeed make a convincing statistical statement using the analysed data. 
Nevertheless, at this point it should be mentioned that we cannot fully exclude 
that a large number of new solutions appears at those values for the complex 
structures which we have not analysed. 

3.2 Complexity 

The main problem of the algorithm used to compute the models lies in the fact 
that its complexity scales exponentially with the complex structure parameters. 
Therefore we are not able to compute up to arbitrarily high values for the Uj. 
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Although we tried our best, it may of course be possible to improve the algorithm 
in many ways, but unfortunately the exponential behaviour cannot be cured 
unless we might have access to a quantum computer. This is due to the fact that 
the problem of finding solutions to the diophantine equations we are considering 
falls in the class of NP complete problems [47] , which means that they cannot be 
reduced to problems which are solvable in polynomial time. In fact, this is quite a 
severe issue since the diophantine structure of the tadpole equations encountered 
here is not at all exceptional but very generic for the topological constraints also 
in other types of string constructions. 

For the total number of models analysed in this paper, which is around 
1.66 X 10®, we used several computer clusters built out of standard PCs with 
an approximative total CPU time of roughly 4 x 10^ hours. To estimate how 
many models could be computed in principle using a computer grid equipped 
with contemporary technology in a reasonable amount of time, the exponential 
behaviour of the problem has to be taken into account. Let us be optimistic 
and imagine that we would have a total number of 10^ processors at our disposal 
which are twice as fast as the ones we have been using. Expanding our analy- 
sis to cover a range of complex structures which is twice as large as the one we 
considered would in a very rough estimate still take us of the order of 500 years. 

Note that in principle there can be a big difference in the estimated computing 
time for the two computational problems of finding all string vacua in a certain 
class and of singling out special realistic ones. Apparently, for the second ques- 
tion, having Standard Model features realized by just a subset or even a single 
stack of branes, the search algorithm can work recursively by successively impos- 
ing more Standard Model features. It would be interesting to know whether the 
Standard Model search can be performed by an algorithm which is polynomial. 

4 Statistical distributions 

4.1 Effect of the K-theory constraints 

Using the explicit constructions obtained by the computer algorithm, it is possible 
to quantify the effect of imposing the K-theory constraints (j2^- The overall 
effect of the additional constraints is to suppress the number of possible brane 
configurations by a factor of five. For example in the case without flux considered 
in this article, i.e. L = 8, and for complex structures Uj = IV/ and non-tilted 
tori, we get approximatively 1.3 x 10® models if we do not impose the constraints 
and ca. 2.3 x 10^ models enforcing them. 

Because the distribution of models is highly dominated by configurations 
where the value for is or 1 (in agreement with the observation that only 
1.6% of all models live on tilted tori), one expects that the constraints of equa- 
tion ()2H1 suppress models with odd total rank. This is indeed the case as can 
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be seen in figure |3(a)| Note that the distribution shown is hmited to the case 
of all complex structures chosen as f// = 1, but different values of the complex 
structure change only the maximum and width of the distribution, not its shape, 
which is therefore generic. 

By contrast, other observables are more or less unaffected by the inclusion 
of K-theory constraints, for example the frequency of models containing at least 
one gauge group f/(M), as shown in figure |3(bj| (as above the distribution for 
one specific value of the complex structures is generic). This is due to the fact 
that the constraints do not limit the individual brane configurations but only 
the overall composition of them to form a consistent model. In both cases the 
super symmetric brane configurations are dominated by stacks of a small number 
Na of branes. 

Let us compare the overall suppression of models by the K-theory constraints 
to the analogous results for Gepner Models [26,48]. Unfortunately, a direct com- 
parison of our gauge statistics and the Gepner Model data is not possible, since 
the abundance of solutions at the Gepner point forced the authors to restrict 
their search a priori to Standard Model like vacua. The effects of implementing 
also the above K-theory constraints for this class of vacua are analysed in [48], 
resulting only in a practically negligible suppression factor. The discrepancy is 
even more striking if one considers that the number of symplectic branes and 
therefore of additional K-theory constraints in many, though not in all of the 
Gepner orientifolds is much higher than on our toroidal orbifold. However, the 
MSSM-like models analysed on the Gepner side satisfy very strong constraints, 
in particular the U{1)y massless condition (cf. equation ((Zj)). We have reason to 
infer that these requirements a priori rule out most of those models which violate 
the K-theory constraints. 

4.2 Correlations between the total rank of the gauge group 
and chirality 

We find a correlation between the rank of the gauge group and the mean chirality 
of the model, which we have defined as 

X =< la'b - Lb >= b-' < 2fJ, >= Yl b-'Pa^b- (26) 

a,be{l,..,k} 

b<a 

In this respect the explicit computer search confirms the results obtained via a 
saddle point approximation in [28]. 

As can be seen in figure 01 the K-theory constraints do not change the overall 
shape of the distribution except for a suppression of models with odd total rank, 
as already seen in figure |3(a)j 
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4.3 Statistics of Standard-like models 



An analysis of the possible realizations of models with MSSM-like features, as 
described in section I2.4| shows a surprising result: We do not encounter any 
three-generation Standard Model in our analysis. As displayed in figure |5(a)| the 
only configurations we have found exhibit one, two or four generations, with a 
strong statistical domination of the one-generation models. 

Given the fact that models with three generations have been constructed 
explicitly in our setup (see e.g. [49,50]) this might appear as an incorrect result at 
first sight. The important point to notice here is that all three-generation models 
known to us share the property that they need values for the complex structures 
which in our conventions are very large and out of the reach of our computer 
analysis. Another issue not to be neglected is that most of the models considered 
in the literature make use of some special features like brane recombination, brane 
splitting (breaking of higher rank gauge groups) etc. From this observation and 
the arguments in section l!m we might draw the conclusion that three-generation 
models are statistically very highly suppressed in this specific orbifold setup. 

4.4 Massless hypercharge 

To gain some further insight into the distribution of Standard Model-like prop- 
erties, we have performed an analysis of our models that includes all constraints 
for MSSM-like models except for the condition of having a massless hypercharge 
U{1)y- Furthermore we allow for a different number of generations in the quark 
and lepton sector. The resulting distribution for different numbers of quark and 
lepton generations in shown in figure El 

From the phenomenological point of view these models are of course not ex- 
tremely useful because the massless f/(l)y condition is important for a consistent 
MSSM structure, but the result is nevertheless quite interesting, as it turns out 
that even within this relaxed framework no three generation model can be found. 
The closest we can get to a three-generation model is a configuration with three 
generations of quarks and four lepton generations. 

4.5 Pati-Salam models 

In addition to the direct realisation of MSSM models we also analysed the occur- 
rence of models with a Pati-Salam gauge group 

SU{4:) X SU{2)l X SU{2)n. (27) 

As can be seen in figure |5(b)j we get one-, two-, four-, six- and eight-generation 
models, but again no model with three generations. Also in this case there exist 
explicit constructions of three-generation Standard Models in the literature [50] 
and our conclusions are the same as in section 117^ - the statistical suppression of 
three generation models is extremely large. 
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4.6 Statistics of the hidden sector 



An interesting question in a statistical context concerns the properties of the 
hidden sector of our models. In figures [7(a)| and [7(b) | the distribution of the total 
rank of the gauge group and the frequency of models containing a U{M) gauge 
factor in the hidden sector are compared with the results one obtains from an 
analysis of all models considered. 

As it turns out the distributions of the constrained models differ only very 
little from those of the full set. The small deviations which are visible in the 
plots cannot be regarded as indications of fundamentally different behaviour. 
They must be seen rather as a remnant of the statistical analysis, which looses 
its applicability in the region of very high rank of the gauge groups where the 
number of models is very small as compared to the region of lower rank. 

This suggests that the statistical distributions are not affected by the addi- 
tional constraints of requiring a specific configuration in the visible sector. This 
observation gives some hints about the correlation of variables, which will be 
explored in greater detail in the section El 

A comparison of the distribution of the total dimension of the gauge group 
in the hidden sector of Standard Model-like configurations with the results for 
Gepner models in [8] shows two quite similar pictures. In figure |8(a)1 we give the 
complete distribution, in |8(b)| we allow for maximally three branes in the hidden 
sector in order to make the result better comparable to [8], where a similar cutoff 
was imposed for computational reasons. The main difference comes from the fact 
that we have much fewer models in our ensemble and, more importantly, our 
models are not constrained to three generations. 



4.7 Gauge couphngs 

The observables considered so far all belong only to the topological sector of 
the model in that they are defined by the wrapping numbers of the branes and 
manifestly independent of the geometric moduli of the compactification manifold. 
A quantity which is relatively easy to analyse but which does contain geometrical 
information are the gauge couplings in the visible sector. In [51] it was argued 
that a MSSM model built with intersecting branes naturally leads to a relation 
between the coupling constants, 

12 11 
— = 7T- + — , (28) 

where ay/s/w are the couplings of the hypercharge, the strong and weak sector 
respectively. Note that this relation of the coupling constants does not imply 
that these models automatically exhibit full gauge unification, but it could fit 
into an SU{5) framework. 

For an honest check of this relation we would have to use the renormalization 
group equations and evolve the coupling constants from the string scale down to 



17 



their low energy values. In our analysis we have not done so, but investigated 
instead the relation between the couplings at the string scale. Therefore we can 
only get some hints towards the actual relation at lower scales. 

To calculate the gauge couplings we use the expression derived in [51] for the 
gauge coupling of a stack a in an intersecting brane model, which reads in our 
conventions 

' ' , ' . j^X^Ui. (29) 



The constant c is given by 



1 M, 



Planck 



- 2V/2 M, '^»> 

and Ha is 1 for an U{N) stack and 2 for a S0{2N) or Sp{2N) stack. Note the 
explicit dependence on the complex structures. 

For the coupling of the hypercharge ay, we have to consider the contribution 
from all stacks of branes in the visible sector and distinguish the three different 
possibilities we consider for a MSSM configuration, as explained in section 12.41 
Using 

Y^2N,x^-, (31) 



1 

ay 



we get for the three configurations 



1 

ay 



1. 

2. 

3. 



1 1 

6 aa 


1 1 
2ac 


1 1 

2 ad 


2 1 


1 

+ — , 

ab 




3 aa 




2 1 


1 

+ — + 2 

ab 


1 


3 aa 


ad 



(32) 



In figure inithe distribution of the ratio to a^ is displayed. We find that only 
for 2.75% of all models ag = a^ at the string scale and that the weak coupling 
constant is generically larger than the strong one. 

Figure imi shows the different values for sin^ 9, depending on the ratio of as/ 



The red line represents the ratio given in 



sin^^ 



sin 9 is calculated from 



ay 



ay + a^ 



(33) 



We find that 88% of the models obey the relation (j2HI)- Note that this result 
might be a bit obscured by the plot because it shows one dot for every possible 
value, not taking into account that there exist many models with exactly the 
same values. In fact there exist many more models with low values for as/a^ 
and sin^ 9, which happen to be the ones fulfilling the relation. 
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These results can also be compared with the analysis of [8] , where as in the case 
of the hidden gauge group one has to take into account that we are dealing with 
a smaller ensemble and with models that are not constrained to exactly three 
generations of fermions. The fraction of MSSM-like Gepner models satisfying 
(I28|l is found there to be only ca. 10%. Without wanting to over-interpret this 
mismatch, we emphasize once more that the gauge couplings are not part of the 
topological sector of the theory and that therefore deviations between the small 
and large radius regime fit into the general picture. 



5 Correlations 

In this section we investigate the correlation between different observables in 
greater detail. As analysed in section 14. 6| the statistics of the hidden sector of 
MSSM-like and Pati-Salam models are, on a statistical basis, very similar to the 
full statistics of all models. The same is also true for the non-trivial correlations 
between observables in our models. This is shown in figure ^2 for the example 
of the correlation between mean chirality and total rank of the gauge group, as 
discussed in section 14.21 

Putting these hints together we might conclude that the 'atomic' frequency 
distributions actually do not depend on the specific kind of models chosen (e.g. 
MSSM-like, Pati-Salam or something else), but only on a generic distribution 
which is already present in the completely unconstrained setup. 

To make this statement more quantitative, we calculate the frequencies for 
some atomic properties important for model building. In particular, we check 
whether two properties A and B are statistically independent in the ensemble of 
D-brane models. A good measure for statistical dependence is given by 

PiA)xPiB)-PiAAB) 

P{A)x P{B) + P{AAB)' ^ ' 

As properties we choose the appearance of models with at least one U (3) or 
U{2)/ Sp{2) gauge factors or with at least one stack without symmetric represen- 
tations. 

The results are presented in figure ^1 depending on the number of stacks 
of the models under consideration. Note that the frequency of finding U{1) 
stacks is close to one and therefore does not lead to any further suppression 
for the appearance of a MSSM like model. The plots demonstrate that, for 
sufficiently large numbers of stacks, the considered properties are statistically 
independent and that therefore their frequencies simply multiply. Note that this 
was an assumption in the statistical estimates carried out in the original work [10]. 
The combined values (integrated over all models) for the correlation between the 
properties of an MSSM-like model (without restriction to massless f/(l)y) is 
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0.094, which is reasonably low to justify considering the individual properties as 
essentially independent. 

5.1 Estimate of three generation Standard Models 

Having found that the constraints are statistically only very little correlated we 
can make some predictions about the overall probability of configurations in our 
setup which do not show up in the data like for example true Standard Models. 
In table |3] we summarize the factors coming from the various constraints. The 
two U{1) gauge groups required for our Standard Model setup are not included, 
because, as mentioned, the frequency of having one of them is essentially one. In 
addition we give the fraction of models - among all configurations containing a 
U{3) X U{2) factor without symmetric representations - which meet in addition the 
individual constraints of a massless hypercharge, of three generations of quarks 
or of three lepton families. 



Restriction 


Factor 


gauge factor f/(3) 

gauge factor U{2)/Sp{2) 

No symmetric representations 

Massless U{1)y 

Three generations of quarks 

Three generations of leptons 


0.0816 
0.992 
0.839 
0.423 
2.92 X 10"^ 
1.62 X 10-3 


Total 


1.3 X 10^9 



Table 4: Suppression factors for various constraints of Standard Model properties. 

Multiplying all these factors leads to an overall suppression factor of 

1.3 X 10-^ (35) 

i.e. only one in a billion models gives rise to a four stack D-brane vacuum with 
Standard Model gauge symmetry and three generations of quark and leptons. 
Multiplying this with the total number of models analysed, ~ 1.66 x 10®, 
leaves us with 0.21 true Standard Models in our ensemble. 

5.2 How good is this estimate? 

To get some idea about the quality of this estimate we can apply the method 
used for three generation models to Standard Model-hke solutions we actually 
did find in our analysis. The result of this computation for two and four family 
models is given in table This check clearly shows that our estimate gives the 
right order of magnitude of the number of expected solutions, but the precise 
value might differ by several standard deviations. 
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# generations 


# of models found 


estimated # 


suppression factor 


2 


162921 


188908 


10-3 


3 





0.2 


^ 10-9 


4 


3898 


3310 


fa 2 X 10-^ 



Table 5: Comparison between the estimated number of solutions and the actual 
number of solutions found for models with two, three and four generations. 

6 Conclusions 

In this work we have given an explicit statistical analysis of the vacuum structure 
in a specific example of Type 11 orientifold models. The bottom line is that a 
Standard Model-like configuration with three families of quarks and leptons in 
this class of models is statistically highly suppressed. The concrete models which 
have so far been constructed in this setup can therefore be regarded as exceptional 
within the vast majority of possible solutions. The same holds also for models 
with a Pati-Salam gauge group. 

A statistical analysis of the different observables shows that there exist non- 
trivial correlations between some of them such as the rank of the gauge group 
and the chirahty of the models. Interestingly, these correlations hardly depend 
on whether we analyse them in the full set of solutions or only in the hidden 
sector of some specific visible configuration. In that respect, the hidden sectors 
of a Standard Model or Pati-Salam construction exhibit universal behavior and 
we expect this to hold for any other visible sector as long as the number of 
constraints it imposes is comparable. 

Despite this general occurrence of correlations, we can quantitatively justify 
the hypothesis that the basic properties of Standard Model-like constructions 
are to sufficiently good accuracy independent. Under this assumption, we derive 
an estimate for the relative frequency of three-generation Standard Models in 
our setup which is of the order of 10-^. Of course, by requiring more of the 
phenomenological features of the Standard Model such as Yukawa couplings, 
gauge couplings, soft supersymmctry breaking terms, the occurrence of realistic 
models will get further reduced. However, these properties really depend on the 
finer details of the models (see for instance [52-55]) and an honest statistical 
treatment of them is much harder to carry out. 

An important question is how generic our results actually are. Even though 
the available data for both frameworks are only partially compatible, we have 
performed a rough comparison with the statistics of the hidden sector of MSSM- 
like Gepner model orientifolds [26] where possible. The geometrical properties 
of these small radius constructions differ considerably from those of the large 
radius regime. However, the results seem to suggest that the observables of the 
topological sector exhibit quite similar distributions. This is not so surprising as 
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the diophantine structure of the consistency conditions describing this sector is 
essentially the same for both frameworks and since, after all, topological quan- 
tities should be protected against too drastic changes as one varies the coupling 
constants or radii. In fact, we regard this as strong hints towards a universal 
behaviour in the distribution of the topological observables also in other string 
constructions. By contrast, the distribution of the gauge couplings, which are 
dependent on the geometric moduli of the theory, do differ. It would be very 
desirable to collect more evidence supporting this pictTirc. 

For this purpose it is important to study the statistics of other quite well 
understood classes of models like for instance heterotic string compactifications on 
elliptically fibered Calabi-Yau manifolds (see for instance [3,56-58]). Questions 
in this direction include: Does the distribution of the rank of the gauge group 
qualitatively have the same shape? What is the abundance of Standard like 
models? In addition, as we started to investigate in our first paper [28], it would 
be interesting to combine the T-dual picture of magnetised branes with additional 
three-form fluxes [49, 59-61] and determine the effect on the statistics. 
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Figure 1: The number of different configurations of for L = 2, satisfying the 
constraints. The x-axis shows combinations of complex structures in an arbitrary 
scale. 
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Figure 2: The number of models for L = 8, plotted against the absolute value of 
U. The highest analysed value of U is 12. 
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Figure 3: Probability distributions for L = 8,Ui = 1. The stars represent the 
distribution without K-theory constraints, the boxes give the result including the 
constraints. 
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(a) with K-theory constraints 




(b) without K-theory constraints 

Figure 4: The frequency distribution of models of specific rank and chirality. 
L = 8,^7/ = 1. 
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Figure 5: Logarithmic plot of the number of models versus the number of gener- 
ations. 



26 



Log(# models) 




Figure 6: Logarithmic plot of the number of models with Standard Model char- 
acteristics, not including the condition for a massless hypercharge, depending on 
the number of generations in the quark {Ql) and lepton (L) sector. 
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Figure 7: Frequency distributions of all models (empty black boxes) and in the 
hidden sector of MSSM models (red stars), MSSM models without restriction to 
a massless U{1) (green diamonds) and Pati-Salam models (blue triangles). 
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(a) Full set of models 




(b) Restricted set with max. three branes in the hidden sector 

Figure 8: Logarithmic plots of the number of models with Standard Model gau 
group and massless U{1) depending on the dimension of the gauge group G 
the hidden sector. 
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Figure 9: Distribution of — in Standard Model-like configurations. 
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Figure 10: Values of sin^O depending on ^ (on a logarithmic scale). Each dot 
represents a class of models with these values. The red line shows the relation 
(OHjl . which is satisfied for 88% of all models 
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(a) MSSM-like models 




(b) MSSM-like models with massive 17(1) 




(c) Pati-Salam models 

Figure 11: The relative frequency to find models of specific rank and chirality, 
requiring different visible sectors to be present in the models. 
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(a) Pc = |-P(3)*P(2/Sp2)-P(2/Sp2A3)|/C 
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Figure 12: Correlations between atomic properties of MSSM-like configurations. 
The factor C is a normalization, see equation (jM)) . Relative frequencies are 
labeled as follows: P(M): frequency of stacks with gauge group U (M); P{2/Sp2): 
frequency of U{2) or Sp{2) stacks; P{ns): frequency of gauge groups without 
symmetric representations. 
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